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Abstract 

We demonstrate and contrast two approaches to the stabilization of qubit entanglement by feed¬ 
back. Our demonstration is built on a feedback platform consisting of two superconducting qubits 
coupled to a cavity which are measured by a nearly-quantum-limited measurement chain and con¬ 
trolled by high-speed classical logic circuits. This platform is used to stabilize entanglement by 
two nominally distinct schemes: a “passive” reservoir engineering method and an “active” correc¬ 
tion based on conditional parity measurements. In view of the instrumental roles that these two 
feedback paradigms play in quantum error-correction and quantum control, we directly compare 
them on the same experimental setup. Further, we show that a second layer of feedback can be 
added to each of these schemes, which heralds the presence of a high-fidelity entangled state in real¬ 
time. This “nested” feedback brings about a marked entanglement fidelity improvement without 
sacrificing success probability. 
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INTRODUCTION 


The ability to perform quantum error correction (QEC) by feedback is a crucial step 
towards fault-tolerant quantum computation [1, 2], An open challenge, that has drawn con¬ 
siderable interest recently[3-5], is to hnd the best strategy for this task. Two nominally 
distinct feedback strategies for QEC are the measurement-based and driven-dissipative ap¬ 
proaches. The former has been more well-understood[6], owing to an existing foundation in 
classical control and feedback in engineering. In the measurement-based (MB) approach, a 
classical controller performs projective measurements of a set of multi-qubit stabilizer opera¬ 
tors that encode the logical qubit [1, 4] in order to track errors and/or perform any necessary 
correction. Thus for good performance, this approach requires both high-hdelity projective 
measurements and low-latency control electronics to process the measurement result within 
the relevant coherence times of the quantum system. The elements required for this MB 
strategy have been demonstrated for small quantum systems on various physical platforms 
such as Rydberg atoms[7], trapped ions[8, 9], photons[10], spin[ll] and superconducting 
qubits[l, 12-14, 16-21]. However, a steady-state multi-qubit QEC capability has yet to be 
achieved and one of the key questions for this development is whether the MB strategy is 
scalable to larger systems or whether an alternative approach is more optimal. 

One such alternative, driven-dissipative (DD) schemes[22], also called reservoir/bath en¬ 
gineering or coherent feedback (as discussed below), utilizes coupling between the quan¬ 
tum system of interest and a dissipative environment to transfer the entropy caused by 
decoherence-induced errors out of the quantum system. They have been demonstrated on 
a variety of physical systems including atomic ensembles[23], trapped ions[24], mechani¬ 
cal resonators[25, 26] and superconducting qubits[6, 27, 28, 30-32]. Moreover, experiments 
with trapped ions[33] and superconducting qubits [34] have demonstrated some of the ba¬ 
sic elements of autonomous QEC. These schemes do not require high-hdelity projective 
measurements, external control and its associated latency. They can also be described as 
autonomous or coherent feedback[35], where the reservoir coupled to the target quantum 
system can be considered as an effective “quantum controller” that reacts with quantum de¬ 
grees of freedom[36]. Adjusting the feedback by changing the “quantum controller”, however, 
can be more challenging than re-programming a classical controller, bnilt with conventional 
electronics. Thus, a further question is whether one can combine this DD approach and 
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the conventional MB approach with minimal negative consequences from their respective 
drawbacks. 

Here we report an experiment in which we built a feedback platform utilizing a nearly 
quantum-limited measurement chain and a customized held-programmable gate array 
(FPGA) system to perform MB and DD schemes within the same setup. The task of 
this platform was to stabilize an entangled Bell state of two superconducting transmon 
qubits[37]. This particular task of stabilizing a single state is a proxy for more general 
QEC experiments where a manifold of states is protected. We realize, for the first time, 
an MB stabilization of a Bell state by repeated active correction through conditional par¬ 
ity measurements [1, 2, 38]. We compare this scheme to a DD entanglement stabilization 
scheme [30] in which the conditional parity switch is autonomous. By performing both 
schemes on the same hardware setup and circuit QED (cQED) system[40], we shed light on 
their close connection and compare them on a level playing held. 

Previous theoretical works have compared DD (under the name of “coherent feedback”) 
and MB for linear quantum control problems[41], such as for minimizing the time required 
for qubit state puri£cation[42] or for cooling a quantum oscillator[43]. These comparisons 
showed coherent feedback to be significantly superior. In our particular setup, we find 
that distinguishing the superior approach among DD and MB is a more subtle task. The 
subtlety is two-fold. First, the performance difference depends on which process can be better 
optimized: the design of the cQED Hamiltonian or the efficiency of quantum measurement 
and classical control. In the current experiment, we show that DD has better steady- 
state performance as the cQED Hamiltonian parameters are engineered such that DD has 
a shorter feedback latency. But DD’s advantage over MB is not immutable. As certain 
experimental parameters are improved, such as coherence times and measurement efficiency, 
MB’s performance can catch up with DD. 

Secondly, in the current situation in which neither the cQED Hamiltonian parameters 
nor the measurement and control parameters are ideal, we can obtain a boosted perfor¬ 
mance by combining DD and MB to get the best of both worlds. We explored this by 
devising a heralding method to improve the performance of both stabilization approaches. 
This protocol exploits the high-fidelity measurement capability and the programmability 
of the feedback platform. The protocol is termed “nested feedback” since it has an inner 
feedback loop based on either the DD or MB scheme, and an outer loop that heralds the 
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presence of a high-fidelity entangled state in real-time. Previously, heralding schemes have 
been demonstrated for state preparation to combat photon loss or decoherence[1, 12, 44-49]. 
Extending such heralding capability to state stabilization will be a valuable addition to the 
QEC toolbox. Furthermore, the ability to herald in real time as opposed to post-selection 
is important for on-demand and deterministic quantum information processing since only 
successful events lead to subsequent processing. Real-time heralding for entanglement stabi¬ 
lization is particularly challenging for superconducting qubits due to their shorter coherence 
times compared to other systems. In this article, we implement this real-time heralding ca¬ 
pability on a time scale faster than the few microsecond coherence time of our qubit-cavity 
system. By extending the feedback platform developed primary for the MB approach to 
the DD approach, our results bring to light a new application of MB. Adding a level of 
MB feedback can significantly improve performance beyond what a single layer of feedback, 
whether DD or MB, can achieve. 


EXPERIMENT SETUP 

The simplified schematic of our experimental setup is shown in Fig. la. Two transmon 
qubits[37], Alice and Bob, are dispersively coupled to a three-dimensional aluminum cavity 
[50], with linewidth k/2t{ = 2 MHz (see Supp. Mat. Sec. I, for other parameters). The qubit- 
cavity dispersive shifts are nearly equal and in the strong dispersive regime (xAiice/27r = 
5 MHz, XBob/27r = 4.5 MHz) with photon-number resolved qubit transition frequencies[51]. 
The cavity output is amplified by a Josephson Parametric Converter (JPC) operated as a 
nearly quantum-limited phase-preserving amplifier [52] enabling rapid, single-shot readout [7] 
and thus real-time feedback. The key component of the experiment is a controller realized 
with two FPGA boards [54] that both measure and actively control the cavity-qubit system. 

An essential operation for our experiment is a two-qubit joint quasi-parity measurement 
using the common readout cavity [1, 2, 38]. As shown in Fig. lb, the cavity is driven at 
fgg (both qubits in ground state) and at /ee (both in the excited state) at the same time. 
The output at fgg and fee together distinguishes the even parity manifold {\gg ), |ee)} from 
the odd parity manifold {\ge ), \eg)}. When the two cavity output responses both have an 
amplitude below a certain threshold, the qubits are declared to be in odd parity; when either 
one has amplitude above the threshold, the qubits are declared to be in even parity. We 
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note that, unlike a true parity measurement, this readout actually distinguishes the two even 
parity states \gg) and |ee), hence we refer to it as a “quasi” parity measurement. However, 
the feedback schemes described below apply the same operation on both even states, and 
thus we need only record the parity of the measured state. The choice of driving at the 
“even” cavity resonances rather than between the “odd” resonances {feg and fge) mitigates 
the effect of the y mismatch, reducing associated measurement-induced dephasing of the 
odd manifold[2]. The controller FPGA a (b) modulates the fgg {fee) drive to the cavity and 
also demodulates the response. The two FPGAs share their measurements of the cavity 
response to jointly determine the parity. In addition, FPGA a and b generate the qubit 
pulses to Alice and Bob, respectively, which are conditioned on the joint state estimation 
during real-time feedback. 


PRINCIPLE OF EXPERIMENT AND RESULTS 

We hrst briefly outline the DD stabilization of entanglement, described in detail in Ref. 6 
and 30. This stabilization targets the two-qubit Bell state |0_) = — \eg)). Figure 2a 

displays the states coupled by the autonomous feedback loop. Two Rabi drives on Alice and 
Bob at their zero-photon qubit frequencies (i^Aiice '^Bob; Supp. Mat. Sec. I) couple 
the wrong Bell state 10+) to the even states, \gg), |ee), in the energy manifold with zero 
cavity photons. A second pair of Rabi drives at the n-photon qubit frequencies (cuAUce ~ 
and — ny, x = (XAiice + XBob) /2), with their relative phase opposite to the first pair, 
couple \gg,n), \ee,n) to the Bell state |0_,n). The two cavity drives, at fgg and fee connect 
the two manifolds and hence the combined action of the six drives transfers the population 
from \gg), |ee) and |0+) to |0_,n). Finally, cavity photon decay brings |0-,n) back to 
|0_,O). In effect, the cavity drives separate qubit states based on their parity, allowing one 
pair of Rabi drives to move the erroneous odd population to the even states while the other 
pair transfers the even states population to |0-). 

Gounterparts to these elements of the DD feedback loop can be found in the corresponding 
MB feedback scheme. The action of our MB algorithm is shown as a state machine in Fig. 2. 
We describe the quasi-parity measurement P by the projectors Podd = Ifi'e) {ge\ -t- \eg) {eg\, 
Pgg = \gg) {gg\ and Pee = |ee) (ee|. We assign the outcomes p = -|-1 to the even projectors, 
Pgg and Pee and p = —1 to Podd- The MB algorithm is built with a sequence of correction 
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steps, each of which consists of a conditional unitary and a quasi-parity measurement. The 
two possible states of the state machine correspond to whether we apply the unitary Ue or 
Uo, followed by the quasi-parity measurement. Specihcally, Ue = -Rx(f) ® where 

a (b) denotes Alice (Bob), and Uq = -Rx(f) ® ^ correction step k, the qubits 

are initially in either \gg), |ee) or in the odd manifold, due to the projective quasi-parity 
measurement in step k — 1] the controller then applies Ue (Uq) if p in previous step reported 
+1 (- 1 ). 

The effect of the state machine on the two-qubit states is shown in Tab. I, where the action 
of the controller during one correction step is described in terms of the four basis states, 
|0_), I0+), \gg) and |ee) (the latter two are grouped in the ’’even” column). The quasi-parity 
measurement inhdelity, labeled by eE\o {.^o\e)i gives the error probability of obtaining an 
even (odd) parity outcome after generating an odd (even) state. Because these measurement 
inhdelities are small, the dominant events are those that occur without measurement errors. 
At each step, Ue on either \gg) or |ee) followed by the quasi-parity measurement P transfers 
the states to |0_) with 50% probability. Since |0_) is an eigenstate of Uq and P (modulo a 
deterministic phase shift that can be undone, see later discussion), these operations leave it 
unaffected. On the other hand, Uq and P transform 10+) into {\gg) , |ee)}; more generally, 
they take population in any other odd state (i.e., a superposition of |0_) and |0+) ) into 
|0_) and the even states. 

By repeating a sufficient number of these correction steps in sequence, the controller 
stabilizes the target Bell state irrespective of the initial two-qubit state. The similarity be¬ 
tween this active feedback and DD is that MB also transfers population between different 
parity states by conditional Rabi drives. However, while the Rabi drives in DD are condi¬ 
tioned autonomously by the photon number in the cavity, the unitary Rabi pulses in MB 
are conditioned by real-time parity measurement performed by active monitoring of cavity 
outputs. 

The pulse sequences for DD and MB are shown in Fig. 2b and e. In DD, a set of 
continuous-wave drives are applied for a fixed time Tg and after some delay T„ to allow 
remaining cavity photons to decay, a two-qubit state tomography is performed [55, 56]. 
The cavity and Rabi drive amplitudes and phases were tuned for maximum entanglement 
hdelity, following the procedure described in Ref. 30. In particular, the optimal cavity drive 
amplitudes were found to be h = 4.0. For MB, the continuous drives are replaced by a pre- 
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TABLE I. Effects of the MB finite state machine of Eig. 2 on two-qubit system in the fc-th step of 
feedback, for different starting cases (columns). Row 2 through 4 describe the result of the previous 
quasi-parity measurement and the corresponding unitary that will be applied in the k-th. step. The 
symbols, e^o eo\E-, denote parity measurement errors (see text). The last row describes the 
possible system states attained by the applied unitary. The two alternative states for a previous 
“even” occur with 50% probability. 

defined number of correction steps iV, resulting in a stabilization duration of T* = NTgtep 
where Tgtep = 1-5 Rs. There is no extra delay before tomography since each correction 
step already contains a delay after the quasi-parity measurement due to feedback decision 
latency. The strength and duration of the quasi-parity measurement P were optimized 
as discussed in Supp. Mat. Sec. II. The optimization achieved low parity measurement 
inhdelities eE\o and eo\E while keeping the measurement-induced dephasing arising from 
the X mismatch[2, 38] small compared to the natural decoherence in the same duration. 
We experimentally determined the inhdelity of the quasi-parity measurement to be eE\o aiid 
eo\E of 0.04 and 0.05, respectively. The quasi-parity measurement also causes a deterministic 
qubit rotation about the respective Z axis due to an AC Stark shift [2]; this rotation was 
corrected within the unitary gate Uq as discussed in Supp. Mat. Sec. VI. 

Fig. 2c,f show the hdelity to the target Bell state 10-) as a function of stabilization time 
for DD and MB, respectively. The hdelity rises exponentially with a characteristic time 
constant of 0.78 fis (1.4 /is) and a steady-state hdelity of 76% (57%) for DD (MB). Both 
hdelity values agree with numerical modeling based on master equation simulation, which 
gives 76% and 58% for DD and MB, respectively (see Supp. Mat. Sec. IV). The exponential 
dependence is a signature of feedback and arises from the characteristic loop time. The 
experimentally determined time constants are in reasonable agreement with their simulated 
values of 1.0 /rs (1.4 /rs) for DD (MB). In MB, this loop time is related to the step length 
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(1.5 fj,s), which is given by the sum of the quasi-parity measurement duration (0.66 ps), 
the cable, instrument and FPGA latencies (0.69 ps), and the duration of unitary pulses 
(0.15 fis). On the other hand for DD, the measured loop time is close to 10 cavity lifetimes, 
the expected time as shown in Ref. 6. 

The superior performance of DD over MB for the steady-state fidelity is due to the 
difference in correction loop time, which needs to be shorter than the coherence times of 
the two qubits for high fidelity entanglement. For the current experimental setup, the 
latency of the controller and quantum efficiency of the measurement chain, which affects 
the fidelity of the single-shot readout, result in a longer loop time in MB. A source of the 
longer feedback loop time is the quasi-parity measurement duration. This measurement 
duration, which was optimized as discussed in Supp. Mat. Sec. II, is limited by dephasing 
induced by the mismatch in y (~ 10%) and the measurement efficiency of the output 
chain (~ 30%), which can both be improved in future experiments. Our simulations (see 
Supp. Mat. Sec. IV) suggest that with current state-of-the-art measurement efficiency value 
and optimization of the FPGA/cable latency, the MB steady state fidelity can be improved 
to 66%. The limited measurement efficiency does not affect the performance of DD because 
the parity measurement and correction take place autonomously within the qubit-cavity 
system, indicative of its robustness against this hardware limitation. On the other hand, 
both DD and MB schemes benefit from longer intrinsic coherence times and reduction of 
the X mismatch. For example, simulations show that if the coherence times are improved to 
one hundred microseconds (achieved in other state-of-the-art cQED setups), both DD and 
MB fidelities can increase to above 85%. For the rest of the article, however, we consider 
boosting the fidelity in a different manner, without making any physical changes to the 
qubit-cavity system. 

The DD and MB schemes described so far are synchronous in the sense that the stabiliza¬ 
tion always ends after a pre-determined duration and the tomography follows. Decoherence 
causes the qubits to have a finite probability of jumping out of the target state immediately 
before the stabilization terminates. Hence, this “fixed time” protocol does not always out¬ 
put the target state with maximum fidelity. An optimum protocol would rather utilize all 
available information. For both DD and MB, we can measure the cavity output at the end 
of the stabilization period. The outcomes of these measurements, Igg and lee, give real-time 
information on the state of the two qubits, and thus can herald a successful stabilization 



sequence. 

In Fig. 3, we describe how monitoring the cavity outputs improves target state hdelity. 
We introduce two thresholds j (see supplementary material for details) to 

post-select the measurement outcomes of Igg and lee respectively, and identify successful 
stabilization runs [1, 30]. The results of varying are shown in Fig. 3b,d for 

DD and MB, respectively. The color plots show hdelity improving as the thresholds become 
more stringent. The success probability dehned as the percentage of stabilization runs kept 
for tomography given a set of thresholds, is also plotted as contours for both DD and MB. 
There is a clear trade-off between success probability and hdelity. To reach the maximum 
hdelity in DD of 82%, at least 75% of experiment runs need to be discarded. The trade-oh 
is less severe in MB, where only 50% of runs need to be discarded to reach the maximum 
hdelity of 75%. However we aim to eliminate this trade-oh all together, i.e., to improve the 
hdelity while maintaining a high success probability. 

This goal is achieved by introducing a nested feedback protocol (NFP), in which the 
stabilization feedback loop enters into a higher layer of feedback for “hdelity boosting” 
instead of proceeding to state tomography directly. In contrast to the “hxed time” protocol, 
NFP conditions the termination of stabilization on the quality of the entanglement, i.e., it 
heralds a successful stabilization run in real-time, as illustrated by the state machine diagram 
in Fig. 4a. The control variable C is given by C = {Igg < AND {lee < where 

^iheraid^ jheraid^ determined by the same post-selection experiment discussed previously 
to optimize the hdelity (black square in Fig. 3b and d). If the controller determines that the 
entanglement quality is not sufficient (C = 0), a boost phase is attempted which comprises 
exactly one correction step for MB or a stabilization period of similar duration for DD 
(1.4 /is). During the boost phase, the cavity outputs are integrated to give {Igg, lee} which 
enables the next real-time assessment of C. In DD, the parity measurement and hrst layer 
of feedback is accomplished autonomously, therefore the FPGA only needs to check C. 
However in the MB scheme both layers of feedback are performed solely by the FPGA. It 
therefore checks if G = 1 to herald that the entanglement meets the desired quality. If not, it 
uses the quasi-parity thresholds (grey circles in Fig. 3d) to decide whether the qubits are in 
even or odd state in order to continue stabilization. This asynchronous pulse sequencing and 
conditioning by multiple thresholds exploit the programmable nature of the FPGA-based 
platform. 
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The asynchronous behavior of NFP is displayed in Fig. 4b(e) for DD (MB), which demon¬ 
strates 200 single-shot runs. The DD (MB) hdelity boosting sequence continues until either 
success or a maximum limit on boost attempts (set to 11 in the experiment) is reached. For 
the MB protocol, the trajectory of the qubits’ parity can be tracked by the conditioning 
outcomes of the inner-loop control variable p and the outer-loop control variable C, which 
are independent. Through repeated boost attempts until success, NFP signihcantly im¬ 
proves the overall success probability. Within 11 attempts, 95% (99.8%) of DD (MB) runs 
satisfy the success condition compared to just 25% (50%) with simple post-selection. This 
is assessed by the cumulative probability, the integral of the probability of having completed 
a certain number of boost attempts before tomography, as plotted in Fig. 4c,f. Since MB 
requires a less stringent threshold than DD to gain fidelity improvement, the MB success 
probability converges to unity much faster than that of DD. Finally, we show that the high 
success probability does not come at the cost of reduced hdelity. The hdelity to |(;/)_) for DD 
improves from an unconditioned value of 76% to 82% (averaged over all successful attempts). 
For MB, the improvement is more pronounced: hdelity rises from an unconditioned value 
of 57% to 74%. Thus for both DD and MB, NFP attain close to the hdelity achieved via 
stringent post-selection. These results for NFP also agree well with a numerical simulation 
(see Supp. Mat. Sec. V). 

One will note, however, a continuous downward trend of the hdelity in both DD and MB 
schemes as the number of attempts increases. This is due to the non-negligible population in 
the I/) states of the two qubits in the experiment, which escape correction by the stabilization 
feedback loops. After each further boost attempt of stabilization, the probability of the 
population escaping outside the correction space thus increases, diminishing the hdelity (see 
Supp. Mat. Sec. VII). Also note that the error bars on the hdelity of MB are bigger than 
those in DD for large attempt numbers simply because the probability of needing many 
attempts is lower in MB than in DD. 

While real-time heralding by NFP removes the trade-oh between hdelity and success 
probability, it does so by introducing a diherent trade-oh - high hdelity and success proba¬ 
bility are achieved but the protocol length now varies from run to run. If NFP is a module 
within a larger quantum information processing (QIP) algorithm, then this asynchronous 
nature must be accommodated by the controller. For our FPGA-based control, NFP is easily 
accommodated because it is a natural extension to “hxed-time” or synchronous operation. 
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In “fixed time” operation, the controller conditions its state by the protocol length which is 
pre-determined and stored in an internal counter by the experimenter. On the other hand in 
NFP, the controller conditions its state on a pre-determined logical function of its real-time 
inputs. 


CONCLUSION 

In conclusion, we have implemented a new MB stabilization of an entangled state of two 
qubits, which parallels a previous DD stabilization scheme. Instead of coherent feedback 
by reservoir engineering, MB relies on actively controlled feedback by classical high-speed 
electronics external to the quantum system. When comparing both schemes in the “hxed- 
time” protocol, we observe that DD gives a higher hdelity to the target state due to lower 
feedback latency. Furthermore, we have improved the fidelity of both schemes by a nested 
feedback protocol which heralds stabilization runs with high-quality entanglement in real 
time. The real time heralding brings about the hdelity improvement without a common 
trade-off in QIP: it does not sacrihce the experiment success probability. It eliminates this 
trade-off by allowing asynchronicity in the experiment. 

Our experiment shows some of the key advantages of MB platforms that have not been 
previously explored. Typically, the performance of MB feedback has not been at par with 
methods based on post-selection, due to the latency of the controller. However it is widely 
recognized that the trade-off of success probability for hdelity in the case of post-selection 
is untenable for large scale systems. Therefore, existing digital feedback[l, 12, 14, 16] have 
focused on achieving nearly perfect success probability. Here, we are exploring another 
direction of feedback which achieves high hdelity with high success probability. Our nested 
feedback strategy maximizes the use of the information coming out of the qubit-cavity 
system in order to make the correction process as efficient as possible. We hnd that our 
feedback platform, comprised of a nearly-quantum-limited measurement chain and a real¬ 
time classical controller, provides the necessary tool-set to implement such a strategy. We 
show that this technology can be extended to improve the performance of DD approaches as 
well as single-layer MB approaches themselves. This strategy could be carried out further 
in the future. For example, the FPGA state estimator could perform a more sophisticated 
quantum hlter of the microwave output of the DD stabilization to herald successful events 
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with better accuracy, significantly improving the success probability convergence rate. 


Similar ideas can be applied in the future towards other forms of stabilization, such as 
for stabilizing Schrodinger cat states of a cavity mode[57], a proposed logical qubit. Ini¬ 
tial experiments on such logical qubits with high fidelity-measurement [58] or dissipation 
engineering[31] have been performed and could now be combined. Likewise, future logi¬ 
cal qubits based on the surface code [4] could also be stabilized by either active stabilizer 
measurements[17-19] or as recently proposed by dissipation engineering[5, 59]. Our exper¬ 
iment demonstrates that measurement-based and driven-dissipative approaches, far from 
being antagonistic, can be merged to perform better than either approach on its own. 
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(b) Transmission 



FIG. 1. (a) Schematic of the experimental set-up. Two independently addressable transmon qubits, 
Alice and Bob, are dispersively coupled to a three-dimensional microwave cavity. The cavity output 
is directed to a nearly quantum-limited measurement chain consisting of a Josephson amplifier 
(JPC) followed by a semiconductor amplifier (HEMT). A pair of custom Field-Programmable 
Gate Array boards (FPGA a, b) monitor the amplified output and generate real-time modulated 
microwave drives to control the cavity-qubit system, (b) Transmission spectra of the cavity. Cavity 
outputs at fgg and fee are fed to FPGA a and b, respectively. 
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FIG. 2. Comparison between the driven-dissipative (DD) and the measurement-based 
(MB) entanglement stabilization, shown in the left (DD) and right (MB) panels, (a) Diagram 
of qubit/cavity state evolution in the DD feedback loop. Two-qubit state manifolds are 
laddered for different photon numbers in the cavity (labeled by |n)). The green and pink 
sinusoidal double arrows represent cavity drives, while the straight double red/blue and 
cyan/yellow arrows are Rabi drives on the qubits. These six drives and the cavity dissipation 
(black decaying arrow) couple the different states of the system such that the target state 
|(;/)_) is stabilized, (b) Functional pulse sequence for DD. The duration needed to empty 
the cavity of residual photons (see text) before tomography is indicated by T^. (c) Fidelity 
to the target as a function of stabilization duration (Tg). Dashed line at 0.5 denotes the 
threshold for entanglement. The time given in the white box, r, is the characteristic time 
constant of the exponential rise of hdelity. (d) State machine representation of the MB 
feedback loop. The quasi-parity measurement reports \gg)i \ee) as p = -|-1 (even) and |0-), 
10+) as p = —1 (odd). For odd (even) parity, two | pulses, with identical (opposite) phases, 
are applied to Alice and Bob, respectively, (e) Sequence of correction steps conditioned by 
the quasi-parity measurement and leading into tomography. Counter k limits the number of 
steps to N. (f) Fidelity to the target Bell state as a function of stabilization duration {Ts) 
or number of correction steps (iV). 


19 




013457 013457 
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FIG. 3. Trade-off between success probability and fidelity for both DD and MB schemes, (a) Pulse 
sequence for heralding DD stabilization by post-selection. At the end of the stabilization period, 
the cavity outputs, Igg and lee are measured at their respective frequencies, (b) Color plot of fidelity 
to the target state for DD as a function of thresholds chosen for Igg and lee (see supplementary 
material for details of the thresholds). Also plotted as white dashes are contour lines of the success 
probability associated with each choice. Solid black square indicates the thresholds chosen for the 
condition C in the nested feedback protocol described in Fig. 4. (c) and (d) same as above for MB, 
including the corresponding thresholds for C. Gray circle indicates the thresholds chosen for the 
quasi-parity measurement p used to condition MB correction steps. 
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FIG. 4. Nested feedback protocol for boosting fidelity and heralding successful stabilization run 
in real-time, (a) State machine representation. The control variable C (see main text and Fig. 3) 
determines the repetition of a boost cycle or the heralding of a successful run. The maximum 
number of boost attempts allowed is set to 11 in the experiment, (b), (e) 200 single-shot sequence 
trajectories of nested feedback for DD and MB, respectively. The trajectories are colored yellow 
during boost attempts and black after C is satisfied. Trajectories that are entirely black satisfied 
the success criterion without the need for boost. The inset in (e) shows an example of an MB 
trajectory consisting of both p and C outcomes. (c),(f) Cumulative success probability (gray 
shade) of having completed at most a given number of boost attempts before tomography for 
DD and MB, respectively. Yellow bars indicate differential success probability. (d),(g) Fidelity to 
target Bell state for DD and MB, respectively. Green squares show the corresponding fidelity as 
a function of the number of boost attempts. The blue dashed line denotes fidelity without any 
boosting (i.e. unconditioned by C). Blue arrows denote the improvement in fidelity due to nested 
feedback. While the absolute fidelity of MB is worse than DD due to latency, the fidelity boost is 
higher. 21 































































Supplementary materials 


I. SYSTEM SETUP 

The cavity-qubit system was housed in an Oxford Triton 200 dilution refrigerator, at 
a base temperature below 20 mK. The setup of the experiment is described in detail in 
Fig. S.l. The key featnre is that both data acqnisition and arbitrary waveform/digital 
marker generation were accomplished by two FPGA boards programmed with a customized 

logic. 

The cavity frequency is fgg = 7.5 GHz when both qubits are in the ground state and 
its linewidth K/27r is 2 MHz. The qubit frequencies with no cavity photons are a;^[j^g/27r = 
4.87 GHz and = 6.18 GHz, for Alice and Bob respectively. The two dispersive 

shifts are XAiice/27r = 5 MHz, XBob/27r = 4.5 MHz. The anharmonicity for the two qnbits 
are CAiice/27r = 212 MHz, CBob/27r = 209 MHz. Alice (Bob) has a Ti of 60 /rs (18 /rs), 
^ 2 ,Ramsey of 9 /is (10 yUs) and excited state popnlation in \eg) and \ge) of 5% each. 

As mentioned in the main text, the fast dispersive single-shot readont used in the exper¬ 
iment is made possible by a nearly quantnm-limited phase-preserving measnrement chain 
based on the Josephson Parametric Gonverter(JPG). The JPG was operated with a gain of 
20 dB and bandwidth of 6.2 MHz, with the freqnency for maximum gain centered between 
the two readont freqnencies, fgg and fee- The gain and noise visibility ratio at both fgg and 
fee were about 17 dB and 6 dB, respectively. 


II. MEASUREMENT STRENGTH AND DURATION CALIBRATION FOR MB 

The qnasi-parity measnrement strength and dnration were optimized in order to maximize 
the hdelity of MB. This optimization was done by maximizing the fidelity of the Bell state 
created in a calibration experiment, similar to Ref. 1. The qnbits are prepared in gronnd 
states (by post-selection) and then two 7r/2 pulses, are applied to Alice and Bob, producing 
the state |'^) = l{\gg) + |ee) -I- \ge) + \eg)). The quasi-parity measurement, consisting 
of the two cavity drives on fee and fgg respectively, projects the qnbits into one of the 
two even states or entangles the qnbits into a Bell state with odd parity. We varied the 
dnration of this parity measurement and its strength in terms of photon nnmber (set to be 
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identical) for each readout frequency to find the parameters that maximize the hdelity of 
the entangled state to the closest Bell state (Fig. S.2). The Bell state hdelity would ideally 
increase and asymptotically approach one with increasing measurement time as the parity 
measurement better distinguishes the odd Bell state from the even states. On the other 
hand, at long measurement duration, the coherence of the entangled state decreases due to 
both natural and measurement-induced dephasing, the latter of which is caused by the y 
mismatch between the qubits and is proportional to the average number of photons used for 
the measurement [2]. Therefore, there is an optimal measurement strength and duration. 
For our experiment, h = 4.5 for each readout frequency and a measurement duration of 
660 ns are found to be close to the optimal values and are chosen to attain a Bell state 
hdelity of 80%. 

The value of 80% sets the upper bound on the hdelity that we should expect for heralding 
MB. In the actual MB experiment, an extra 310 ns delay was introduced after the quasi¬ 
parity measurement in a correction step, which does not occur in the sequence described 
in this section for optimizing the parity measurement parameters. This extra delay was 
required to accommodate the feedback latency in MB. The conditioned hdelity we obtained 
for heralded MB is about 6% lower. 


III. MEASUREMENT OUTCOMES DISTRIBUTION FOR DD AND MB 

The cavity outputs at fgg and fee for both DD and MB can be used to monitor the state 
of the qubits during stabilization (Fig. S.3a). Histograms of the measurement outcomes Igg 
and lee, recorded by integrating the cavity output at fgg and fee, respectively, are shown in 
Fig. S.3b,c,d,e for DD and MB respectively. In DD, the cavity output signals are captured 
while all the CW drives are still on, i.e., the qubits are being driven while their states are 
being monitored, whereas in MB the outputs are a result of the quasi-parity measurements 
which occur after the qubit pulses and thus when the qubits are not driven. We observe that 
the measurement outcome distribution of DD lacks the separation seen in MB which has a 
clear parity separatrix This feature also appears in numerical simulations 

of DD by the stochastic master equation [3]. The state estimation used in both DD and MB 
uses the “box car” hltering [4] which simply sums up the recorded cavity output signals over 
time to obtain the measurement outcomes. This method, while appropriate for MB, is not 
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suited for DD since in the latter, the qubits are undergoing actively-driven dynamics when 
the measurement is taking place. A more advanced hlter, such as a non-linear quantum hlter 
can be designed from either first-principles or machine learning [5] in the future to improve 
the state discrimination accuracy in DD. 

The measurement outcomes to the left of both the (shown in hgure) and 

thresholds are much less likely to come from even states than those to the right. Therefore 
the experiment runs with these outcomes are selected for state tomography, giving the 
results plotted as a color map in Fig. 3 (main text). Moving the threshold further to the left 
increases the stringency of the threshold as fewer measurement outcomes are included. The 
success probability for each threshold choice (plotted as contours in Fig. 3) is calculated by 
the ratio of included outcomes to the total number of experiment runs. 


IV. STEADY STATE MODEL FOR DD AND MB 

The steady state behavior of both DD and MB is simulated by a Lindblad master equa¬ 
tion, given by 

=-^[H{t),p{t)] + KD[a]p{t)+ ) 

j=A,B J 

(S.l) 

D is the Lindblad super-operator, dehned for an operator O as D[0]p = OpO"'— {1/2)0''Op— 
{l/2)p0'^0. The pure dephasing rate for Alice and Bob, respectively, is given by 1/T^’^ = 
1 /T^,b _ where + 1/Tf’^. 

The Hamiltonian H{t) is treated differently in DD and MB. For DD, the Hamiltonian is 
described in detail in the theory proposal [6] and parameters in the Hamiltonian, such as the 
cavity and qubit drive amplitudes, are swept in simulation to hnd the optimal values. The 
optimal value for the cavity drive amplitude is found to be K\/fi/2 with n = 4.0, and k/2 for 
the qubit drive amplitudes at both zero-photon and n-photon qubit frequencies. The DD 
simulation predicts a characteristic time constant of 1 /is and a steady state fidelity of 76% 
(accounting for the delay between stabilization and state tomography to allow remaining 
cavity photons to decay). 

For MB, a correction step is broken into four segments for effectively piecewise master 
equation simulation. The hrst part contains the conditional Rabi pulses which are simulated 
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as perfect instantaneous unitary operations on the qubits. The second part is the decay 
during the pulses (154 ns total). The Hamiltonian during this part is just the the dispersive 
interaction between the qubits and the cavity, H(t) = H^isp = (xaO'^/2 + Xb^^z 
the rotating frame of the two qubits and the cavity mode + a;“)/2). The 

third part is the quasi-parity measurement during which the cavity drives at the fgg and fee 
resonances are on and the Hamiltonian is given by H{t) = -|-2ecCos {{xa + XB)t/2) {a + 

al), where Cc is the amplitude of the cavity drive (660 ns total). The last part is the remainder 
of the correction step, incurred by the latency of the feedback during which all drives are off 
and the qubit-cavity system is in free-decay. The dynamics in this part is again simulated 
by the dispersive interaction, H(t) = Hdisp (686 ns total). For the piecewise master equation 
simulation of a complete correction step as four segments, the density matrix at the end of 
a segment is used as the initial density matrix for the next segment. 

Since in MB, the state at the end of a correction step depends only on the initial state 
at the beginning of the step, we can model the MB scheme as a Markov chain (Fig. S.4). 
In the rest of this section, we show how we derive the transition matrix that describes this 
Markov chain. 

As discussed in the main text, we can describe the qubits by the density matrix p = 
71- |0_) (0_| -|- 7r+ 10+) (0+1 -f- Tigg \gg) {gg\ -|- Ties |ee) (ee|. Therefore, in terms of probability 
distributions in the four basis states, the qubits’s state, S, can be represented by a vector. 



If the qubits are prepared in |0-), that is = (1, 0,0, 0)"*", we can calculate after a 
correction step by applying the master equation simulation method described above. We 
need to consider the two possible cases where the conditional unitary applied is Uq or Ue, 
respectively. The S^_ is a weighted average of the two cases. 

•sif = (1 - fElo) * (S-3) 

eE\o and eo\E are the quasi-parity measurement inhdelities due to limited measurement 

~ ( f) 

efficiency, introduced in the main text. In a similar manner, we can obtain In the case 
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of an even initial state, for example, Sg^g = (0, 0,1, 0)"'", we have 

= (1 - eo,E) » + eoiE * (S,4) 

And similarly for See ■ 

Given S^_, S^_^, Sgg and See\ we can construct the transition matrix T of a correction 
step, 

q-— (qif) cif) c{f) Qif)] ^ii 

' ~ W4>-^if >+1 ^gg ^^ee ) 

where the are the columns of the 4 by 4 matrix. Now applying this transition matrix 

on any arbitrary initial state gives the final state after a correction step, 

SU) = (S.6) 


The transition matrix T is also called the stochastic matrix, with the property that each 
column sums to 1. One of T’s eigenvalues is guaranteed to be 1 and the corresponding 
eigenvector, Soo, is the steady state of the Markov chain. It can easily be shown that for 
any arbitrary initial state, 

lim = Aoo (S.7) 

k^oo 

For the given experimental parameters in MB, the T matrix is displayed in Fig. S.4 and we 
find the steady state eigenvector to be 


^oo = 


/0.58\ 

0.11 

0.18 

V0.13/ 


(S.8) 


Thus, the Markov model predicts a steady state hdelity of 58% to |0-). Taking into account 
of the duration of a correction step (1.5 /xs), we can also calculate the characteristic time 
constant of the MB scheme from the model, which gives 1.4 /is. Both values agree very well 
with experimental results. 

We can calculate the expected hdelity when some of the experimental parameters are 
improved in the near future. If the measurement efficiency is improved from 30% to the 
current state-of-the-art value of 60%, the measurement duration can be reduced by half 
while maintaining the quasi-parity measurement in£delities[7]. The instrument and FPGA 
latencies incurred in the experiment can also be reduced by 100 ns, in the latest hardware 
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Steady state fidelity 




DD 

MB 

Experiment 

current parameters 

76% 

57% 

Simulation 

r] = 0.6, latency = 586 ns 

76% 

66% 


Ti,T 2 = 100 fis 

86% 

86% 


TABLE S.l. Listing of the steady state fidelities of the “fixed time” protocol for DD and MB 
schemes for the current experiment (see detailed parameters in Supp. Sec. ) and simulation with 
improved parameters, assuming y values as in the current experiment. The prospective values in the 
last row also take into account the parameter changes in the second row. Note that measurement 
efficiency and control latency change do not affect DD. 

setup and FPGA logic design in operation while this article was being prepared. The 
measurement duration and control latency reduction can shorten the correction step length 
to 1 /rs, which can improve the steady state fidelity to 66%. Furthermore, if the coherence 
times are also improved to the state-of-the-art values in the hundred of microseconds for 
superconducting qubits, both DD and MB fidelities in the “fixed time” protocol can be 
above 85%, limited by the y mismatch (assumed to be 10%, as in the current experiment). 
The prospects of both DD and MB schemes are summarized in Tab. S.l 


V. SIMULATION OF REAL-TIME HERALDING BY NESTED FEEDBACK PRO¬ 
TOCOL FOR DD AND MB 

The Markov chain model introduced in Sec. Ill can be extended to simulate NFP 
(nested feedback protocol). We can construct a “nested” Markov chain (Fig. S.5) for each 
boost attempt of NFP. At the outer-most level, there are two nodes. One node denotes the 
trajectories that have just been heralded as successful; the other node denotes the trajectories 
that require at least another boost attempt. Building on the vector description established 
in the previous section, we represent the heralding of trajectories before a boost attempt by 
a diagonal matrix c, 
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c = 




C0+ 


-'aa 


V 


(S.9) 


Cee / 


After stabilization of some pre-determined duration, if the qubits are (on average) in 
state then the average state of the qubits that are heralded is then given by, 


c(o) 

^herald 




(S.IO) 


where || ■ ||i is the Li norm of the vector (the sum of the entries in the vector). The 
normalization is required since heralding selects only a subset of the trajectories, i.e., the 
matrix, c, does not preserve the norm of S. 

Each entry on the diagonal of c gives the fraction of trajectories that get heralded (se¬ 
lected) from a particular state. Their values depend on the particular heralding thresholds 
used, and (Supp. Sec. II). This matrix can be determined phenomenologically 

by the post-selection experiment shown in Fig. 3 (main text). From the experiment, we can 
hnd the average state without any conditioning (no heralding), = (vri^^ Trle^)"'" 

and the average state of the heralded trajectories, Given the 

success probability Pg of using the thresholds (the white dashed contour line of Fig. 3b,d in 
the main text), the diagonal elements can be calculated as. 


tt ' Pg 


Tl'P. 








do) 




tt' P 
aa^^ 

^33 


Cpp - 


KePs 

( 0 ) 

'Kee 


(S.ll) 

TTT' TTj 

For the specihc heralding thresholds used in the experiment (represented by the black 
squares in Fig. 3b,d in the main text), cdd and cmb are explicitly given by. 



/0.26 

\ 


/0.68 

\ 

Cdd — 

0.20 

0.19 

, Cmb — 


0.69 

0.19 


V 

O.lSyi 


V 

0.10/ 


(S.12) 


Ideally for c, only c^_ should be non-zero. But in practice, since we cannot distinguish 
10-) and I0+), is comparable to c^_. Furthermore, for both DD and MB, Cgg and c^e 
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are also non-neglibile. In DD, this is predominantly due to the lack of separation between 
the even and odd measurement outcomes as discussed in Supp. Sec. II. In MB, the qubits 
can jump during the delay between the completion of the quasi-parity measurement and the 
end of a correction step due to Ti events. Thus for MB, trajectories that are heralded by 
very stringent thresholds still have a non-zero probability of being in the even parity states. 

Given the heralding matrices cdd and cmb, we can now calculate the average state of the 
trajectories that are not heralded and thus require a boost attempt as 


( 0 ) 


5(0) _ (_p^\ c(0) c(0) _ ^boost 

'^boost \ j ^boost II II 


(S.13) 


where we introduce the lowercase as the unnormalized population distribution vector. 
II ■ 111 denotes the Li norm of the numerator. 

After this boost attempt, the qubits are in state 


g(i) _ ^ 

where T is the stochastic transition matrix that models the stabilization during a boosting 
attempt. In Supp. Sec. Ill, we have already found T for MB. By the same method, we 
can also derive the effective transition matrix of a boost attempt for DD. The calculation 
of T for DD is an approximation: due to the continuous cavity drives, the state of the 
qubits at the beginning of a boost attempt is entangled with a qubit-state dependent cavity 
state, which we approximate unconditionally by the average steady-state cavity state in DD. 
Nonetheless, as we shall show, the model still produces a quantitative behavior that agrees 
very well with the experimental results. From the above equations, it is easy to show that 
the average qubits state of the heralded trajectories after k boost attempts is given by 




(k) 

herald 


c{T{I-c)f 


(S.15) 


In the case of a pre-stabilization of sufficient number of correction steps (or duration), 
is given by the steady-state, Soo-, introduced in Supp. Sec. III. 

For each of the k boost attempts, the hrst entry in the vector i.e., 7r_, gives 

the hdelity to |0-). The Li norm of the numerator in the expression for gives the 

percentage of trajectories that have completed k boost attempts. Summing the percentages 
over all values of k from 0 to the maximum limit gives the overall success probability. In 



Fig. S.6, we show the results of the simulation using the model described here and compare 
them to the experimental results presented in Fig. 4 of the main text. Furthermore, from 
the simulation, we hnd that with the realistic system parameter improvement as specihed 
in Supp. Sec. Ill, the hdelity of heralded trajectories can be 90% and 95% for DD and MB, 
respectively, with order unity success probability. 


VI. MEASUREMENT-INDUCED AC STARK SHIFT AND CORRECTION FOR 
MB 

The quasi-parity measurement induces a deterministic phase shift between the two qubits 
due to measurement-induced AC Stark shift [2]. This is evidenced by examining the phase of 
the Bell state created in the measurement optimization experiment described in Supp. Sec. I, 
in which we varied the measurement duration. As the measurement duration is increased, 
the Bell angle of the hnal Bell state changes linearly (Fig. S.7a). In order for MB to work, 
we need to account for the deterministic phase shift induced by the measurement. This 
correction is accomplished by a rotation on Bob before the unitary Uq- Fig. S.7b 
gives one example of a sequence trajectory to illustrate how the correction works. With 
no loss of generality (and the reason will become clear soon in the discussion that follows), 
we construct Ue = i?“(7r/2) 0 R^_^^{'k/2) and Uq = i?“(7r/2) 0 i?^^(7r/2) (where 0o = 0 
corresponds to the X axis) such that |0o) = \ge) + 6**^° \eg) is the eigenstate of Uq and 
applying Ue on the even states results in l^o) with 50% probability after the quasi-parity 
measurement. Suppose that the qubits are in the ground states, is applied and the 
subsequent quasi-parity measurement gives p = —1. During the quasi-parity measurement, 
a deterministic phase shift of is added. Since the measurement reports odd, the next 
conditional unitary is Uq. The Z gate before Uq undoes the phase shift and recovers the 
eigenstate which Uq leaves unchanged. After another parity measurement, the qubits are in 
the state legf), with the phase shift added again. Consequently, we can see that 

the MB sequence actually stabilizes \ge)-\-e''^^°^^^'^ \eg) state. In practice, for our experiment, 
the Z rotation for Bob is constructed from a composite of X and Y rotations, such that 
R\{P) = Rx{^)Ry{0)Rx{—^)i and the effective correction angle 6 is swept (Fig. S.7c) to 
find the optimal value that cancels the deterministic phase shift and thus maximizes the 
hdelity. Furthermore, to make the target state of the stabilization |0_), the rotation axis 
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of the pulses on Bob in Uq and f/g is chosen such that 4>o + <Pd = tt. This correction for 
measurement-induced AC Stark shift is also done in the simulation for MB. 


VII. F STATE MEASUREMENT DURING NFP 

While we have been treating our two qubits as purely two-level systems, in reality there 
are higher energy levels, in particular the second excited level is expected to play a non- 
negligible role in the dynamics. We find that the equilibrium qubit population not in the 
\gg) state was about 15%, and the / state was also populated. To investigate whether the 
decrease of the hdelity in NFP as a function of boost attempts number is due to the role 
played by the /-state population, we measured the populations in \ fg), |/e), \gf), |e/), \ ff) 
after a given number of boost attempts in DD. The population in \fg) was measured by 
applying a n pulse on Alice’s e-f transition, then a n pulse on its e-g transition, followed 
by a measurement of the population in \gg). The other /-state populations are similarly 
obtained. The sum of these 5 populations gives the total /-state population plotted in 
Fig. S.8, which indeed increases as a function of boost attempt number. It is therefore 
plausible that the decrease in hdelity with respect to number of boost attempts is due to 
the system being trapped in other levels outside the correction space. This problem, which 
is common in similar atomic physics experiments, could be addressed by additional drives. 
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FIG. S.l. Setup of the experiment. The qubit-cavity system was placed at the base stage 
of a dilution refrigerator (Oxford Triton200) below 20 mK. On the input side: Two FPGAs 
(Innovative Integration X6-1000M) generated the pulse envelopes in I and Q quadratures 
to modulate the qubit drives at fAlice and /Bob frequencies (Vaunix Labbrick LMS-802) for 
Alice and Bob, respectively. The I/Q modulations were output by the FPGAs with one and 
two single-sideband modulations in MB and DD (so that both zero-photon and n-photon 
qubit frequencies were addressed), respectively. The microwave frequency drives and I/Q 
modulation were mixed by IQ mixers. Two Agilent N5183 microwave generators produced 
the cavity drives at fee and fgg respectively. The cavity drives were also pulsed by the 
FPGAs. On the output side: the transmitted signal through the cavity was directed by two 
circulators to the JPG for nearly quantum-limited amplihcation. It was further amplihed 
at the 3 K stage by a cryogenic HEMT ampliher. After additional room temperature am- 
plihcation, the signal was split into two interferometric setups for readouts at the fgg and 
fee frequencies, respectively. In each of the interferometers, the signal arriving from the 
fridge was mixed with a local oscillator set -|-50 MHz away to produce a down-converted 
signal at 50 MHz. A copy of the cavity drive that did not go through the fridge was also 
down-converted in the same manner to produce a reference. Finally, the two signals with 
their respective references were sent to the analog-to-digital-converters on the FPGA boards 
for digitization and further demodulated inside the FPGAs. The two FPGAs jointly esti¬ 
mate the qubits’ state by communicating their results with each other. Along the input 
and output lines, attenuators, low pass hlters and homemade Eccosorb hlters were placed at 
various stages to protect the qubits from thermal noise and undesired microwave and optical 
frequency radiation. Moreover, the cavity and JPG were shielded from stray magnetic helds 
by aluminum and cryogenic /r-metal (Amumetal A4K) shields. 
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measurement duration (ns) 


FIG. S.2. Experimental calibration of measurement strength and duration for MB. Fidelity to the 
state \4>+) = :^(|5'e) + l^d)) after preparing the qubits in a maximally superposed state followed 
by a quasi-parity measurement with post-selection. The fidelity is plotted as a function of mea¬ 
surement duration and shown for a set of measurement strengths. The optimal values chosen for 
the experiment are 660 ns and h = 4.5. 



FIG. S.3. (a) The cavity outputs at fgg and fee are integrated to obtain measurement outcomes 
Igg and lee, respectively. (b),(c), (d), (e) Histograms of the measurement outcomes Igg and lee 
for DD and MB, respectively. The leftmost dashed line (labeled and indicates a 

particular choice of threshold for heralding the measurement outcomes. Gounts to the left of both 
thresholds are declared success. The right dashed line {Igg^^^^ and indicates the threshold 

for quasi-parity measurement in MB. 
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FIG. S.4. Markov model of a correction step in MB. Transition between any two nodes is possible 
and is represented as a directional edge. The 16 possible transitions make up the 16 matrix elements 
in the stochastic transition matrix, T, corresponding to a correction step in MB. Numbers next 
to the edges represent the elements of T calculated for the current experiment parameters by a 
master equation simulation. 
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1 — c 



boosting 


FIG. S.5. Markov model of the NFP (nested feedback protocol) for real-time heralding. This 
model is an extension of the model introduced for MB. During a boost attempt, transitions occur 
inside the “boosting” node for further stabilization. The edge leading from the “boosting” to the 
“heralded” node represents the real-time heralding that selects some fraction of trajectories by the 
threshold-dependent matrix c (see text) at the end of a boost attempt. Those that are not selected 
(1 — c) enter into another boost attempt. 
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FIG. S.6. Simulation of real-time heralding by NFP. The left (right) panel presents the results for 
DD (MB), (a), (c) Cumulative success probability of having completed at most a given number of 
boost attempts before tomography for DD and MB, respectively. Green curve shows the exper¬ 
imental result as presented in the main text. Blue curve is the simulation result using the same 
experimental parameters, (b), (d) Fidelity to \(j)-) for DD and MB, respectively. Cyan dashed line 
denotes the unconditioned steady state fidelity obtained in the experiment. Green squares (blue 
circles) show the corresponding fidelity as a function of the number of boost attempts during NFP 
obtained in the experiment (simulation). 
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(a) measurement-induced phase shift (c) sweeping phase correction 
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FIG. S.7. Correcting for deterministic measurement-induced AC Stark shift (a) Deterministic 
phase shift induced by measurement as a function of measurement duration, (b) One example of 
a sequence trajectory illustrating the phase correction R\{9) at work. See detailed explanation in 
accompanying text, (c) Fidelity of the steady state to the target Bell state as a function of the 
correction angle of the effective Z gate on Bob. 
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FIG. S.8. Experimental measurement of / state population plotted as a function of the boost 
attempts number in NFP for DD. 
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